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Related Application 

The subject matter of this application is related to the subject matter in a 
20 co-pending non-provisional application by inventor Daniel ManHung Wong, filed 
on the same day as the instant application entitled, "Reforming Queries to 
Selectively Audit Accesses to Rows Within a Relational Database," having serial 
number TO BE ASSIGNED, and filing date TO BE ASSIGNED (Attorney 
Docket No. OR00-01101). 



Field of the Invention 

The present invention relates to providing security in computerized 
30 databases. More specifically, the present invention relates to a method and an 
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apparatus for selectively auditing accesses to relational database tables based upon 
auditing conditions. 

Related Art 

Databases commonly store highly sensitive data, such as salaries, 
corporate financial data, and even classified military secrets. For security reasons 
it is essential to be able to audit accesses to this sensitive data. Conventional 
database systems typically provide a general auditing facility that records an audit 
trail containing general information about the user and the query issued. 

However, conventional auditing facilities have a number of shortcomings. 
They do not record specific information about the application, the session 
environment or most importantly, the query results. Consequently, information 
gathered by a conventional auditing facility is frequently insufficient to 
reconstruct an event, or even to determine whether access rights have been 
violated. 

In conventional relational database systems, auditing facilities only record 
information regarding which tables are accessed, not whether certain rows inside a 
given table are accessed. This table-level auditing tends to generate a large 
number of false audit records because many accesses to a given table do not touch 
sensitive data. 

What is needed is an auditing mechanism that can specify a finer 
granularity of audit conditions during accesses to relational tables in order to 
minimize the number of false audit records that are generated. 

Another problem in auditing database accesses arises in distributed 
database architectures, in which an application located on an application server 
sends a query to a database located on a database server. In this type of 
distributed architecture, auditing is typically performed by embedding customized 
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auditing mechanisms into the application on the applications server, not at the 
database server. Relying on the application to perform auditing can give rise to 
many problems because a large number of applications can potentially access the 
database. Consequently, it is almost impossible to ensure that each one of these 
applications is configured to perform the auditing properly. 

What is needed is an auditing mechanism for database accesses which 
does not rely on applications outside of the database server to perform auditing. 

SUMMARY 

One embodiment of the present invention provides a system that 
selectively audits accesses to a relational database system. This system starts by 
receiving a query from a client at a database server. The system processes this 
query at the database server to produce a query result. The system also creates an 
audit record for rows in relational tables that are accessed by the query, and that 
satisfy an auditing condition. Next, the system records the audit record in an audit 
record store and returns the query result to the client. Integrating the auditing 
facility into the relational database system in this manner ensures that auditing is 
performed in the same way regardless of which application generates the query. 
Furthermore, this auditing is transparent to applications and users. 

In one embodiment of the present invention, the system additionally 
modifies the query so that processing the query causes the audit record to be 
created and recorded for rows in relational tables that are accessed by the query 
and that satisfy the auditing condition. In a variation on this embodiment, the 
auditing condition is associated with a table in the relational database system. 

In one embodiment of the present invention, the auditing condition 
includes a plurality of auditing conditions. 
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In one embodiment of the present invention, the audit record includes a 
user name for a user making the query, a time stamp specifying a time of the 
query, and a text of the query. 

In one embodiment of the present invention, the system determines if 
auditing is enabled, and if so, creates the audit record. 

In one embodiment of the present invention, auditing is enabled on a table 
by table basis. 

In one embodiment of the present invention, the audit record is created 
while the query result is being generated. 

In one embodiment of the present invention, the audit record is created 
after the query result is generated. 

In one embodiment of the present invention, the auditing condition 
includes a condition for a field within the relational database system. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates a distributed computing system in accordance with an 
embodiment of the present invention. 

FIG. 2 illustrates a table within a relational database in accordance with an 
embodiment of the present invention. 

FIG. 3 is a flow chart illustrating the process of auditing a query in 
accordance with an embodiment of the present invention. 

FIG. 4 is a flow chart illustrating the process of reforming a query for 
auditing purposes in accordance with an embodiment of the present invention. 

DETAILED DESCRIPTION 

The following description is presented to enable any person skilled in the 
art to make and use the invention, and is provided in the context of a particular 
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application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
5 present invention is not intended to be limited to the embodiments shown, but is 
to be accorded the widest scope consistent with the principles and features 
disclosed herein. 

The data structures and code described in this detailed description are 
typically stored on a computer readable storage medium, which may be any device 

10 or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital video discs), and 
computer instruction signals embodied in a transmission medium (with or without 
a carrier wave upon which the signals are modulated). For example, the 

15 transmission medium may include a communications network, such as the 
Internet. 

Distributed Computing System 

FIG. 1 illustrates a distributed computing system 100 in accordance with 
20 an embodiment of the present invention. Distributed computing system 1 00 
includes a number of computer systems (nodes), including clients 102-104, 
application server 108 and database server 1 10. Computer systems 102-104, 108 
and 110 can generally include any type of computer system, including, but not 
limited to, a computer system based on a microprocessor, a mainframe computer, 
25 a digital signal processor, a personal organizer, a device controller, and a 
computational engine within an appliance. 
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Computer systems 102-104, 108 and 1 10 are coupled together by a 
computer network (not shown). This computer network can include any type of 
wire or wireless communication channel capable of coupling together computing 
nodes. This includes, but is not limited to, a local area network, a wide area 
network, or a combination of networks. In one embodiment of the present 
invention, this network includes the Internet. 

Clients 102-104 can include any node on a network including 
computational capability and including a mechanism for communicating across 
the network. In the embodiment of the present invention illustrated in FIG. 1, 
clients 102-104 communicate with application 107 located on application server 
108. Application 107 in turn communicates with relational database 109 on 
database server 110. Application server 108 and database server 1 10 can include 
any node on a computer network including a mechanism for servicing requests 
from a client for computational and/or data storage resources. Note that 
application server 108 serves as a client for database server 110. 

Database server 1 10 contains relational database 109. Relational database 
109 can generally include any type of database system designed around relation 
tables. Within relational database 109 there exist a number of mechanisms, 
including query processor 120 and auditing mechanism 122. Query processor 120 
operates on a number of relational tables 113, including table 1 14 and table 116. 
Auditing mechanism 122 records audit records to audit record store 118. 

Note that tables 1 13 and audit record store 1 13 reside on storage device 
1 12, which is controlled by database server 1 10. Storage device 1 12 can include 
any type of non- volatile storage device, such as non- volatile storage devices based 
on magnetic, optical and/or magneto-optical storage devices. Storage device 1 12 
can also include non-volatile semiconductor storage devices based on flash 
memory or battery-backed up random access memory. Note that copies of 
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selected portions of tables 1 13 and audit record store 1 18 can also exist within 
memory in database server 110. 

Table Structure 

FIG. 2 illustrates the structure of table 1 14 within relational database 119 
in accordance with an embodiment of the present invention. Table 1 14 has the 
same structure as a typical table in a relational database, including a number of 
rows containing various fields. 

Table 1 14 also includes a row-level auditing flag 206 which indicates 
whether auditing is enabled for the table. In another embodiment of the present 
invention, the system includes a database-wide row-level auditing flag, which 
enables row-level auditing for the entire relational database 109. 

Table 1 14 is additionally associated with a number of auditing conditions, 
including auditing condition 202 "salary > 1,000,000" and auditing condition 204 
'title = 'CEO"\ Auditing conditions 202 and 204 instruct relational database 109 
to create an audit record for any rows that are accessed within table 1 14 that 
include a salary field with a value greater than 1,000,000, or that include a title 
field specifying a CEO. Accesses to other rows in table 1 14 that do not satisfy 
either of these auditing conditions do not cause an audit record to be generated. 

Process of Auditing a Query 

FIG. 3 is a flow chart illustrating the process of auditing a query in 
accordance with an embodiment of the present invention. The system starts by 
receiving a query 123 at relational database 109 within database server 110 from 
application 107 within application server 108 (step 302). 

The system first determines if auditing is enabled by checking all tables 
referenced by the query to see if an auditing flag is set for the tables (step 303). If 
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auditing is not enabled, the system processes query 123 as usual to produce a 
query result 124 (step 305), and then returns query result 124 (step 314). 

If auditing is enabled, the system modifies the query by inserting 
monitoring logic into query 123 (step 304). This monitoring logic causes an audit 
record to be created and recorded for any rows that satisfy an auditing condition. 

In one embodiment of the present invention, query 123 is modified by 
inserting statements into query 123 to make query 123 call a function that creates 
and records auditing records if a row in a table satisfies the auditing conditions. 

In another embodiment of the present invention, modifying query 123 
involves creating two separate queries. A first query additionally includes 
restrictions based on the auditing conditions. This query produces audit records 
only for rows that satisfy the auditing conditions. A second query is unmodified 
from the initial query and is used to produce the query result. 

In one embodiment of the present invention, the auditing condition is 
specified for the entire relational database 109. In another embodiment, the 
auditing condition is specified on a table-by-table basis. 

Next, the optimization layer performs view merging (step 305), and the 
system processes query 123 to produce the query result 124 (step 306). In doing 
so, the system creates an audit record for rows that are accessed by query 123 and 
that satisfy the auditing condition (step 3 10). Alternatively, the audit record can 
be produced by running a second query at a later time to produce the audit record. 

Note that the audit record includes a user name for a user making the 
query, a time stamp specifying a time of the query, a text of the query and specific 
variable bindings for the query. 

The system then records the audit record in audit record store 118 (step 
312) and then returns the query result 124 to application 107 (step 314). 
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Process of Reforming a Query 

FIG. 4 is a flow chart illustrating the process of reforming a query for 
auditing purposes in accordance with an embodiment of the present invention. 
The system starts by receiving a query 123 at relational database 109 (step 402). 
The system next retrieves any auditing conditions that might exist for tables 
involved in query 123, such as auditing conditions 202 and 204 associated with 
table 1 14 (step 404). 

If query 123 includes a statement that modifies a table within relational 
database 109, such as a delete statement, an insert statement or an update 
statement, the system uses pre-existing triggers in the database system to create 
audit records for these statements (step 405). Note that a database system 
generally maintains a log of changes to database tables, so providing an auditing 
mechanism for these changes in addition to the log may be unnecessary. The 
system then processes query 123 to produce query result 124 (step 418). 

If query 123 includes a select statement, the system appends a case 
statement to query 123 for each auditing condition using a view expansion 
mechanism (step 406). For example, suppose that the auditing condition is DEPT 
= 'SALES' and that query 123 is, 

SELECT * FROM payroll WHERE salary> 150,000; 

The system appends a case statement to the select statement as follows. 

SELECT * FROM 
( 

SELECT * FROM payroll 

WHERE (CASE WHEN (dept= 'sales') THEN SYS_AUDIT() 
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ELSE NOT NULL END) IS NOT NULL; 

) 

WHERE salary > 150,000 

In this example, the case statement will cause the S YS_AUDIT() function to be 
called if the auditing condition is satisfied. This S YS_AUDIT() function causes 
an audit record to be created and recorded. 

Note that the function S YS_AUDIT() always returns NOT NULL (unless 
an error condition arises within S YS_AUDIT()). Therefore the predicate is 
always evaluated as TRUE. Hence, the predicate does not affect the logic of 
query 123, and the predicate can be safely applied to query 123. 

Next, the query processor is allowed to allocate buffers for query 123 (step 
408). After the buffers are allocated, the system removes the case statement from 
query 123 (step 410). In one embodiment of the present invention, this involves 
moving a query filter list data structure that refers to the predicate to a temporary 
location in the query block data structure that contains all the meta-information 
for query 123. Moving the predicate in this way prevents the optimizer from 
reorganizing the CASE clause and query plan layer to allocate row source for the 
CASE clause. 

After optimization, the system generates a query plan for query 123 to 
specify an order in which the operations involved in query 123 are carried out 
(step 412). 

After the query plan is generated (but before generating the execution plan 
for any "GROUP BY" clause that may exist) the system generates an execution 
plan for the case statement so that the case statement is evaluated only for rows 
that satisfy all the preceding conditions in query 123 (step 414). In this way, an 
audit record is created only for rows are accessed by query 123. 
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Next, the system processes query 123 to produce query result 124 (step 418). This 
causes an audit record to be created (step 420) and recorded (step 422) for each 
row that satisfies the conditions of query 123 as well as the auditing conditions. 

Finally, the system returns query result 124 to the entity that issued query 
5 123 (step 424). 

The foregoing descriptions of embodiments of the invention have been 
presented for purposes of illustration and description only. They are not intended 
to be exhaustive or to limit the present invention to the forms disclosed. 
Accordingly, many modifications and variations will be apparent to practitioners 
10 skilled in the art. Additionally, the above disclosure is not intended to limit the 
present invention. The scope of the present invention is defined by the appended 
claims. 
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